Found another instruction optimization that saved two cycles each product